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(54) Recording medium, data recording unit and data playback unit reading from and writing to 
the recording medium 



(57) The present invention provides a recording me- 
dium and a recording/playback unit for use with the re- 
cording medium. The recording medium comprises a 
still image data area (102) capable of storing a plurality 
of still image data (VOB) pieces therein and an area 



(102) storing still image set management information 
(VOBSI) therein for managing a part or the whole of the 
stilt image data (VOB) in the still image data area as one 
still image set (VOBS). Each still image set (VOBS) has 
the corresponding still image set management informa- 
tion (VOBSI). 
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Description 

[0001] The present invention relates to a recording 
medium to and from which digital data may be written 
and read, to a recording unit recording digital data on it, s 
and to a playback unit playing back digital data from it. 
Particularly, the present invention relates to an optical 
disk on which multimedia data, such as video data, still 
image data, and audio data, may be recorded and to a 
recording unit and playback unit. 10 
[0002] A phase-change disk DVD-RAM (Digital Ver- 
satile Disc-RAM) with a capacity of several GB (Giga 
Bytes) has been introduced into the field of writable op- 
tical disks with a maximum capacity of about 650MB 
(Mega Bytes). As MPEG (MPEG2), the standard for is 
coding digital AV (Audio and Video) data, is employed 
for practical use, DVD-RAM is now expected for use not 
only on computers but also as recording and playback 
media in the AV field. That is : it is predicted that DVD- 
RAMs will become media replacing magnetic tapes 20 
which have been used as standard AV recording media. 
[0003] (Description of DVD-RAM) Recently as the re- 
cording density of a rewritable optical disk increases, not 
only computer data or audio data but also image data 
may be recorded on the optical disk. For example, on 2s 
the signal-recording surface of an opticat disk, the guide 
grooves in the form of projection and ditch have been 
provided conventionally. 

[0004] In former days, signals were recorded only in 
the projection or the ditch positions. The introduction of 30 
the land-groove recording method has made it possible 
for signals be recorded in both the projection and the 
ditch positions. This method has achieved about twice 
as high density as before (For example, Japanese Pat- 
ent Laid-Open Application No. JP-A-8-7282). 35 
[0005] The CLV method (Constant Line Velocity) effi- 
ciently increases the recording density. A method such 
as the zone CLV method which makes the CLV method 
easier to control and implement was also devised and 
put into practical use (For example, Japanese Patent 40 
Laid-Open Application No. JP-A-7-93873). 
[0006] One of major problems with an optical disk with 
an ever-increasing capacity is how to record AV data, 
including image data, and how to implement perform- 
ance and new functions much higher than those of con- 45 
ventional AV equipment. 

[0007] With the advent of this large-capacity, rewrita- 
ble optical disk, it is expected that tapes which have 
been used in most cases for AV data recording and play- 
back will be replaced by optical disks. A shift in recording so 
media from tapes to disks will have various influences 
on the function and performance of AV equipment. 
[0008] One of the most prominent advantages of the 
shift to disks is a great improvement in the random ac- 
cess performance. An attempt to make a random ac- ss 
cess to data on a tape involves rewinding one volume 
of tape which will usually take the order of minutes. This 
is much larger than the seek time (several ten milli-sec- 



ond or less) of optical disk media. Thus, the tape cannot 
be used practically as a random access device. 
[0009] This random access performance of an optical 
disk makes possible the distributed recording of AV data 
which would be impossible on conventional tapes. 
[0010] FIG. 1 is a block diagram showing the drive of 
a DVD recorder. In the figure, reference numeral 11 is 
an optical pickup which reads data from the disk, 12 is 
an ECC (error correcting code) processor, 13 is a track 
buffer, 14 is a switch switching input/output of the track 
buffer, 15 is an encoder, 16 is a decoder, and 17 is the 
enlarged view of a recording area on the disk. 
[0011] As shown in 17, the minimum unit of data re- 
corded on the DVD-RAM disk is 1 sector=2KB. The ECC 
processor 12 performs error correction processing on 
16 sectors = 1 ECC block. 

[0012] The track buffer shown by 1 3 is a buffer used 
to record AVdata at variable bit rates to efficiently record 
AV data on the DVD-RAM disk. This buffer acts as a 
buffer to resolve the difference between the DVD-RAM 
read/write rate (Va in the figure) which is constant the 
and the AV data bit rate (Vb in the figure) which varies 
according to the complexity of the contents (such as im- 
age data of video). 

[0013] More efficient use of this track buffer 1 3 allows 
AV data to be distributed on the disk. This is described 
below using FIGS. 2A and 2B. 

[0014] FIG. 2A is a diagram showing the address 
space of the disk. As shown in FIG. 2A, when AV data 
is recorded in separate contiguous areas [al, a2] and 
[a3, a4], supplying data, stored in the track buffer, to the 
decoder during the seek operation from a2 to a3 allows 
AV data to be played back continuously. FIG. 2B shows 
how data is accumulated into, and supplied from, the 
track buffer. 

[0015] AV data, which is read starting from at, is input 
into, and output to, the track buffer beginning at time tl. 
The amount of data corresponding to the difference in 
rate (Va - Vb) between the track buffer input rate (Va) 
and the track buffer output rate (Vb) is accumulated in 
the track buffer. This condition continues until data at a2 
is read (timet2). The amount of data B(t2), accumulated 
up to this time, is used as data that is supplied to the 
decoder until time t3 at which reading starts at a3 ar- 
rives. 

[0016] In other words, it the amount of data ([al , a2]) 
accumulated before the seek operation is equal to or 
larger than a sufficient amount, AV data may be supplied 
continuously even if the seek operation happens. 
[0017] In the above example, data is read, or played 
back, from a DVD-RAM. The example also applies when 
data is written, or recorded, onto the DVD-RAM. 
[0018] As described above, if the data exceeding a 
sufficient amount is contiguously recorded on the DVD- 
RAM, continuous playback/recording is possible even if 
AV data is distributed on the disk. 
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(Description of MPEG) 

[0019] Next, AV data is described. 

[0020] As described above, AV data recorded on a 

DVD-RAM uses the international standard called MPEG 

(ISO/IEC13813). 

[0021] A DVD-RAM, with a large capacity of several 
GB, is not large enough to store non-compressed digital 
AV data. This means that AV data must be compressed 
before being recorded. One of the popular methods for 
compressing AV data is MPEG (I SO/I EC 13818). A re- 
cent advance in the LSI technology makes it possible to 
implement an MPEG codec (compression/decompres- 
sion LSI chip), allowing the DVD recorder to MPEG- 
compress/ decompress data. 

[0022] For highly efficient data compression, MPEG 
has the following two major characteristics: 
[0023] The first characteristic is that, in addition to the 
conventional compression method using the spatial fre- 
quency characteristics, MPEG uses a compression 
method using inter-frame time correlation characteris- 
tics for compressing video data. To compress data, 
MPEG classifies frames (also called pictures in MPEG) 
into three: I picture (intra -frame coded picture), P picture 
(picture using intra-frame coding and a reference to the 
preceding picture), and B picture (picture using intra- 
frame coding and a reference to the preceding and fol- 
lowing pictures). 

[0024] FIG. 3 shows the relation among I, R and B 
pictures. As shown in FIG. 3, the P picture refers to the 
immediately preceding I or P picture, while the B picture 
refers to the immediately preceding and following I or P 
picture. Also, because the B picture refers to the follow- 
ing I or P picture, the display order of pictures does not 
always match that (coding order) of compressed data 
as shown in FIG. 3. 

[0025] The second characteristic is that MPEG allo- 
cates an amount of coding dynamically to each picture 
depending upon the complexity of the image. The 
MPEG decoder has an input buffer and accumulates da- 
ta in this decoder buffer, making it possible to allocate 
a large amount of code to a complex image which is dif- 
ficult to compress. 

[0026] Audio data used on a DVD-RAM may be se- 
lected from the following three: MPEG audio data and 
Dolby digital data (AC-3) which are compressed and 
LPCM data which is not compressed. The bit rate of Dol- 
by digital data and LPCM data is fixed. The size of 
MPEG audio data may be selected from several sizes 
in units of audio frames which are not so large as video 
streams. 

[0027] This AV data is multiplexed into one stream us- 
ing a method called a MPEG system. FIG. 4 is a diagram 
showing the configuration of the MPEG system. The ref- 
erence numeral 41 is a pack header, 42 is a packet 
header, and 43 is a payload. The MPEG system has a 
hierarchical structure consisting of packs and packets. 
A packet is composed of the packet header 42 and the 
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payload 43. AV data, divided into several pieces each 
in an appropriate size, is stored in the payload 43 be- 
ginning at its head. The packet header 42 contains in- 
formation on the AV data stored in the payload 43; it con- 

s tains the ID (stream ID) identifying the stored data as 
well as the decoding time DTS (Decoding Time Stamp) 
with precision in 90 kHz and display time PTS (Presen- 
tation Time Stamp) of the data included in the payload 
(For data such as audio data which is decoded and dis- 

10 played almost at the same time, the DTS is omitted). A 
pack is a unit composed of a plurality of packets. Since 
one pack is used for one packet for DVD-RAM, a pack 
is composed of the pack header 41 and a packet (packet 
header 42 and payload 43). In the pack header is re- 

*5 corded the SCR (System Clock Reference) which is the 
27MHz-precision time at which data in the pack is input 
into the decoder buffer. 

[0028] A MPEG system stream like this is recorded 
on the DVD-RAM, one pack per one sector (=2048 
20 bytes). 

[0029] Next, the decoder decoding the above-de- 
scribed MPEG system stream is described. FIG. 5 
shows the decoder model (P-STD) of the MPEG system 
decoder. The refernce numeral 51 is an STC (System 

25 Time Clock) measuring the standard time used in the 
decoder, 52 is a de-multiplexer which decodes, or de- 
multiplexes, a system stream, 53 is an input buffer of 
the video decoder, 54 is a video decoder, 55 is a re-order 
buffer in which I and P pictures are stored temporarily 

30 to adjust trie-difference between the data order and the 
display order of I pictures and P pictures described 
above, 56 is a switch adjusting the output order of the I 
pictures and P pictures stored in the re-order buffer, 57 
is an input buffer of the audio decoder, and 58 is an audio 

35 decoder. 

[0030] The system decoder having this configuration 
processes the above -described MPEG system stream 
as described below. When the time of the STC 51 
matches the SCR described in the pack header, the de- 

40 multiplexer 52 receives the pack. The de-multiplexer 52 
interprets the stream ID contained in the packet header 
and transfers the streams of data in the payload to the 
decoder buffer 53 or 57 for each stream. The de-multi- 
plexer 52 also gets the PTS and DTS from the packet 

45 header. When the time of the STC 51 matches the DTS, 
the video decoder 54 gets picture data from the video 
buffer 53, decodes it, stores the I and P pictures in the 
re-order buffer 55, and displays the B pictures. When 
the picture the video decoder 54 decodes is an I or P 

50 picture, the switch 56 is switched to the output terminal 
of the re-order buffer 55 to output the preceding I or P 
picture from the re-order buffer 55; when the picture the 
video decoder 54 decodes is a B picture, the switch 56 
is switched to the output terminal of the video decoder 

55 54. Like the video decoder 54, when the time of the STC 
51 matches the PTS (there is no DTS for audio data), 
the audio decoder 58 gets one frame of audio data from 
the input buffer 57 and decodes it. 



3 



EP 0 965 991 A1 



5 

[0031] Next, the multiplexing method of an MPEG 
stream is described with reference to FIG 6. FIG. 6(a) 
shows video frames, FIG. 6(b) shows the video buffer. 
FIG. 6(c) shows an MPEG system stream, and FIG. 6 
(d) shows audio data. The horizontal axis, common to 
all figures, is the time axis. Data in each figure is drawn 
based on this time axis. In the figure showing the video 
buffer status, the vertical axis indicates the buffer occu- 
pancy (amount of data accumulated in the video buffer) 
with the bold line indicating the chronological change in 
the buffer occupancy. The slope of the bold line corre- 
sponds to the bit rate, indicating that data ts input into 
the buffer at a constant rate. A reduction in the buffer 
occupancy at a regular interval indicates that data is de- 
coded at that time. The intersection of the dotted oac- 
onal line and the time axis indicates the time a: wnich 
the transfer of video Irames to the video buffer is stance 
[0032] The following describes the operation with 
complex video data image A as an example. As shown 
in FIG. 6(b), the data of image A must be transferred lo 
the video buffer at time t1 that is earlier than the decode 
time (The time from the data input time t1 to the decode 
time is called vbv_delay) because image A requires a 
large amount of code. As a result, the AV data is multi- 
plexed in the position of the video pack indicated by the 
shaded area in FIG. 6(c). On the other hand, audio data, 
which does not require dynamic coding amount control 
as with video data, need not be transferred earlier than 
the decode time; in most cases, audio data is multi- 
plexed some time earlier than the decode time. There- 
fore, for video data and audio data that are played back 
at the same time, the video data is multiplexed before 
the audio data. It should be noted that, for MPEG, all 
data except still-image data must be output from the 
buffer to the decoder within one second. This means 
that the maximum difference in the multiplexing time be- 
tween video data and that of audio data is one second 
(Strictly speaking, the time needed for re-ordering video 
data may be added to the maximum time). 
[0033] In this example, although video data is multi- 
plexed before audio data, audio data may be multi- 
plexed before video data theoretically. When highly- 
compressed, easy-to-process video data is prepared 
and the audio data is transferred much earlier, it is pos- 
sible to create such data. However, because of the lim- 
itation of MPEG described above, audio data may be 
transferred not earlier than one second. 

(Description of digital still camera) 

[0034] Next, a digital still camera is described. 
[0035] Recently, digital still cameras using JPEG 
(ISO/IEC 10918-1) have become popular. The popular- 
ity of digital still cameras is due to the fact that personal 
computers have rapidly come into wide use recently. Im- 
ages taken by digital still cameras may be easily cap- 
tured into personal computers via semiconductor mem- 
ory, floppy disks, infrared light communication, and so 



forth. The still images captured into personal computers 
may be used in presentation software products, word 
processors, and internet contents. 
[0036] In addition, digital still cameras capable of cap- 
5 turing sounds have become used. The capability of re- 
cording sounds has given digital still cameras another 
advantage over conventional film cameras. 
[0037] FIG. 7 shows the relation between JPEG data 
recorded by a digital still camera and the directories and 
io files on a PC (personal computer). 

[0038] As shown in FIG. 7, JPEG data is recorded in 
one file (with the extension code of "JPG"). When the 
number of files exceeds a predetermined number and it 
becomes difficult for the user to manage the files, they 
'5 are usually organized into the directory structure, each 
directory including about 100 tiles as shown in FIG. 7. 
[0039] However, the number of still images that can 
be recorded by a digital still camera is limited by the re- 
cording capacity of the recording media such as flash 
20 memory or floppy disks. A large number of still images 
cannot be recorded. For example, when still images, 
each 50KB in size : are recorded in the 100MB flash 
memory, the maximum number of still images that may 
be recorded at a time is as small as about 2,000 still 
25 pictures. 

(Description of digital VCR) 

[0040] Next, a digital VCR, in particular, a DVC which 
30 has rapidly become popular recently, is described. 

[0041] The introduction of the DVC has implemented 
new functions not provided on the conventional VCR. 
One of them is a recording in which video and still im- 
ages are mixed. 
3$ [0042] FIG. 8 is a diagram showing how the DVC 
records video and still images. 

[0043] As shown in FIG. 8, the DVC allows video and 
still images to be mixed in a sequential order on tape, 
allows video and still images to be alternately recorded, 
40 or allows still images to be recorded continuously just 
as they would on an album. 

[0044] However, the DVC, which is a tape medium, 
tacks random accessibility. In addition, it has no man- 
agement information similar to that used on the compu- 
ten making it difficult for the user to play back a particular 
still image the user wishes. 

[0045] The introduction of the DVD-RAM means a po- 
tential new AV equipment which solves the problem of 
limited number of still images of digital cameras and the 
so problem of random accessibility of the DVC and which 
enables the user to process tens of thousands of still 
images freely. 

[0046] As described above, the DVD-RAM is expect- 
ed as one of the next-generation AV recording media. 
55 The present invention solves the following problems 
which prevent the performance of the DVD-RAM from 
being maximized. The present invention also enables a 
DVD recorder to be implemented. The DVD recorder is 
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thought of as the intended and most important applica- 
tion of the rewritable large -capacity optical disk DVD- 
RAM. 

[0047] The most serious problem of processing a 
large amount of still image data on the DVD recorder is 
that the amount of management information is extreme- 
ly large. 

[0048] The still image data management information 
is described with reference to FIG. 9. 
[0049] Access to still image data recorded on the disk 
requires information such as the address and the size 
of data the user is going to access. 
[0050] In addition, the addition of sound data as on a 
digital still camera requires information not only on the 
address and the size but also on the playback time of 
the sound data. Post-recording, which is recorded sep- 
arately after still image data is recorded, also requires 
post-recording audio data management information. 
[0051] Access to the 4.7 GB data area, one sector at 
a lime (1 sector = 2048 bytes), requires 4 bytes for the 
address, 1 byte for still image data, and 2 bytes for 
sound data: in addition, for sound data, another 2 bytes 
is required for the playback time. The post-recording of 
sounds requires twice as large management informa- 
tion, with the total management information area being 
21 bytes in size. 

[0052] If 65000 still images are recorded and 21 bytes 
of management information is used for each still image, 
the size of the management information is calculated as: 

65000 x 21 bytes = 1365000 bytes 

The total of about 1 4 MB of management information 
is required. 

[0053] Although 1 .4 MB of data is small as compared 
with the DVD recording capacity, the system controller 
(corresponds to the CPU ot a PC) must always have this 
data in memory for use in random access. Despite a sig- 
nificant drop in the price of memory, it is unusual for AV 
equipment to have memory larger than one MB. And, it 
is impractical for AV equipment to have a battery backup 
for backing up the memory, larger than one MB, against 
an emergency. 

[0054] In a first aspect, the present invention provides 
a recording medium which minimizes the storage area 
for data management information to allow the recording 
area to be used efficiently, a recording unit which 
records data on the recording medium, and a playback 
unit which plays back data from the recording medium. 
In a second aspect, a recording medium according to 
the present invention comprises a still data image area 
(102) in which a plurality of still image data (VOB) pieces 
may be recorded and an area (102) in which still image 
set management information (VOBSI), managing the 
still image data (VOB) in a part and the whole of the still 
image data area as a gathering still image set (VOBS), 
is recorded. The still image set (VOBS) has correspond- 
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ing still image set management information (VOBSI). 

BRIEF DESCRIPTION OF THE DRAWINGS 

s [0055] Fig. 1 is a block diagram of a DVD recorder 
drive unit. 

[0056] Fig. 2A is a diagram showing the address 
space on a disk. 

[0057] Fig. 2B is a diagram showing the accumulation 

io amount of data in the track buffer. 

[0058] Fig. 3 is a diagram showing the relation of pic- 
tures in an MPEG video data stream. 
[0059] Fig. 4 is a diagram showing the configuration 
of the MPEG system stream. 

15 [0060] Fig. 5 is a block diagram of an MPEG system 
decoder (P-STD). 

[0061] Fig. 6(a) is a diagram showing video data, FIG. 
6(b) is a diagram showing a video buffer, FIG. 6(c) is a 
diagram showing an MPEG system stream, and FIG. 6 
20 (d) is a diagram showing audio data. 

[0062] Fig. 7 is a diagram showing the still image man- 
agement method in a digital still camera. 
[0063] Fig. 8 is a diagram showing the recording sta- 
tus of video and still images of a digital VTR. 
25 [0064] Fig. 9 is a diagram showing the configuration 
of still image management information. 
[0065] Fig. 10(a) is a diagram showing the directory 
structure, and FIG. 1 0(b) is a diagram showing the phys- 
ical arrangement on a disk. 
30 [0066] Fig. 1 1 A is a diagram showing management in- 
formation data. 

[0067] Fig. 11 B is a diagram showing stream data. 
[0068] Fig. 1 2 is a diagram showing the configuration 
of still image set management information 
35 [0069] Fig. 1 3 is a diagram showing a link relation be- 
tween still images and audio data. 
[0070] Fig. 14 is a flowchart showing how to deter- 
mine a still image address and how to check whether or 
not audio data is present. 
40 [0071] Fig. 15 is a block diagram of a DVD recorder/ 
player. 

[0072] Fig. 16 is an diagram showing an example of 
a still image enable flag. 

45 DETAILED DESCRIPTION OF THE EMBODIMENTS 

[0073] The present invention will be described more 
in detail using a DVD recorder and a DVD-RAM which 
are one embodiment of the present invention. In the de- 

so scription of the embodiment, the term "player" some- 
times includes the function of a player as well. 
[0074] (Logical configuration of a DVD-RAM) First, 
the logical configuration of a DVD -RAM will be de- 
scribed with reference to FIG. 10. FIG. 10(a) shows the 

55 configuration of data on the disk viewed from the file sys- 
tem, and FIG. 10(b) shows the physical sector address 
on the disk. The physical sector address begins with the 
lead-in area 1 00 where reference signals or other media 
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identification signals necessary to stabilize the servo are 
recorded. The lead-in area 100 is followed by the data 
areas 101 and 102. In this data area is written logically 
effective data such as video data, still image data, and 
audio data. The logical sector address is ended by the 
lead-out area 103 where reference signals and so on 
are recorded as in the lead-in area. 
[0075] The data area begins with volume information 
area 101 which is management information for use by 
the file system. 

[0076] Data on the disk may be treated as directories 
or files via the file system as shown in FIG. 10(a). 
[0077] All data processed by the DVD recorder is 
placed under the VIDEO_RT directory which is immedi- 
ately under the ROOT directory. 
[0078] The files processed by the DVD recorder are 
classified roughly into two: one management file and a 
plurality of (at least one) AV files. 

(Management file) 

[0079] Next, referring to FIG. IIA, the contents of the 
management information file are described with empha- 
sis on the management information on video. 
[0080] The management information file is classified 
roughly into the VOBI (VOB information) table and the 
PGCI (PGC information) table. A VOB is an MPEG pro- 
gram stream, while a PGC defines the playback order 
of cells in a logical playback unit in a sub-range (or whole 
range) of a VOB. In other words, the VOB is meaningful 
for MPEG, while the PGC is a unit the player plays back. 
[0081] The VOBI table consists of the number of VO- 
BIs (Number_of_VOBIs) and a plurality of VOBIs. Each 
VOBI consists of the corresponding AV file name (AV 
File Name), VOB identifier on the disk (VOBJD), start 
address in the AV file (VOB_Start_Address), end ad- 
dress in the AVfile (VOB_End_Address), VOB playback 
time length (VOB_Playback Time), and stream attribute 
information (VOB_ Attribute). 

[0082] The PGCI table consists of the number of PG- 
CIs (Number_of_PCGIs) and a plurality of PGCIs. Each 
PGI consists of the number of cell! (Cell information) en- 
tries and the cellls. Each celll consists of the playback 
start time in the VOB (Cell_Start_Time), playback time 
in the VOB (CelLPIayback_Ttme), playback start ad- 
dress in the VOB (CeU_Start_Address), and playback 
end address in the VOB (CeILEnd_Address). (AV file) 
[0083] Next, an AV file is described by referring to FIG. 
11B. 

[0084] An AV file consists of a plurality of VOBs, which 
are recorded in the Av file consecutively. It should be 
noted that the AV file sometimes consists of only one 
VOB. The VOBs in the AV file aro managed by the VOB 
information in the above-described management file. 
The player first accesses the management information 
file, reads the VOB start address and end address, and 
then accesses the VOB. Within the VOB are defined 
cells which are logical playback units. A cell is a partial 



playback range (or whole range). This cell allows the 
user to perform simple editing without having to operate 
on actual AV data. As with access information on a VOB, 
access information on a cell is maintained in the man- 

s agement information file. The player first accesses the 
management information file, reads the cell start ad- 
dress and the end address, and then accesses the cell. 
[0085] Cell address information is relative to the start 
of the VOB, and VOB address is relative to the start of 

10 the AV file; therefore, the VOB address is added to the 
cell address to calculate the address in the AVfile before 
the player accesses the AV file. 

(Still image data management information) 

15 

[0086] Next, by referring to FIG. 12, still image data 
management information is described. 
[0087] For still image management information, VOB- 
SIs (VOBS information), instead of VOBIs, are stored in 

20 the VOBI table. Each VOBS is a set of a plurality of 
VOBs, each consisting of a still image and audio data 
synchronizing the still image if any. 
[0088] A VOBSI consists of the corresponding AV file 
name (AV_File_Name), VOBS identifier for identifying a 

2S particular VOBS among a plurality of VOBSs on the disk 
(VOBSJD), start address in the AV file 
(VOBS_Start_Address), end address in the AV file 
(VOBS_End_Address), still image management table 
(Video Table) containing management information on 

30 the still image data in the VOBS, and audio manage- 
ment information table (Audio_Table) containing man- 
agement information on the audio data in the VOBS. 
[0089] The still image management information table 
(Video_Table) consists of at least one entry of still image 

35 management information (Videol), one for each still im- 
age, and information on the number of still image man- 
agement information entries (Nurnber_of_Videols). The 
still image management information (Vtdeol) consists of 
one byte of still image data size information (Size) and 

40 one byte of pointer information (Ptr_to_ Audiol) pointing 
to the audio management information in the audio man- 
agement table (Audio_Table) for the audio information 
to be played back with the still image. 
[0090] The audio management information table 

45 (Audio_Table) contains audio management information 
(Audiol) on each piece of audio data and the number of 
audio management information entries 
(Number_of_Audiols). The audio management informa- 
tion (Audiol) contains 4 bytes of audio data address in- 

so formation (Address), 2 bytes of audio data size informa- 
tion (Size), 2 bytes of audio playback time information 
(Playback_Time), and 1 byte of pointer information 
(Ptr_to_Audiol) pointing to the audio information (Audi- 
ol) within the audio management information table 

55 (Audio_Table) where post-recording audio data is 
stored for use as post-recording audio information when 
post-recording is used. 

[0091] The PGC! table which defines the playback se- 
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quence contains information different from that of video 
on a cell) level. A still image set celll consists of the iden- 
tifier (VOBSJD) of the corresponding VOBS, start VOB 
number in the VOBS (Cel1_Start_Video), and end VOB 
number in the VOBS (CelLEnd_ Video). 
[0092] This configuration allows the cells of a still im- 
age set in a range (from any still image to any still image) 
of the VOBS to be played back. 

[0093] Next, referring to FIG. 13, the link between still 
images and audio data is described. 
[0094] The still image management information (Vid- 
eol) has pointer information (Ptr_Jo_Audiol) pointing to 
the audio management information (Audiol) in the audio 
table (Audio_Table). An insignificant value (=0) in this 
field indicates that the still image has no synchronizing 
audio data to be played back (Video#3 and Video#4). 
Conversely, a significant value, if included in the pointer 
information (Ptr_to_Audiol), indicates that the still image 
has synchronizing audio data to be played back (Vid- 
eo#1 and Video#2). 

[0095] When post-recording data is added and new 
audio data is recorded, pointer information 
(Ptr_to_Audiol) pointing to some other audio manage- 
ment information (Audiol) is created in the audio man- 
agement information (Audiol). As with the pointer infor- 
mation (Ptr_to_Audiol) in the still image management 
information (Vtdeol) described above, a significant val- 
ue in the pointer information (Ptr_to_Audiol) in the audio 
management information (Audiol) indicates that there is 
post-recording audio data (Audio#1 -> Audio#3). 
[0096] Next, the relation between still image manage- 
ment information (Video! )/audio management informa- 
tion (Audiol) and actual data (AV data) in an AV file is 
described. 

[0097] The order of still image management informa- 
tion (Videol) in the still image management information 
table (Video_Table) matches the order in which still im- 
age data was recorded in the AV file. Also, the order of 
audio management information (Audiol) in the audio 
management information table (Audio Table) matches 
the order in which audio data was recorded in the AVfile. 
[0098] Therefore, for a VOBS consisting only of still 
image data with no audio data, the address of a still im- 
age may be calculated simply by adding the still image 
data sizes (Size) recorded in the still image manage- 
ment information (Videol) beginning at the start of the 
VOBS. 

[0099] When audio data is enclosed by still images 
(audio 1 and audio 2), the address generated by adding 
the still image data sizes is compared with the address 
in the audio management information (Audiol). If they 
match, it is found that the audio data is recorded at this 
address and therefore the data size of the audio data is 
added to the address. By repeating this calculation, all 
still image data in the VOBS may be accessed. 
[0100] Next, referring to the flowchart in FIG. 14, ac- 
cess to still images and audio data, recorded on the op- 
tical disk used in the embodiment of the present inven- 
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tton, is described more in detail. 

[0101] First. Add indicating the current address, the 
variable i indicating an entry number in the still image 
management information table (Video Table), and the 
5 variable j indicating an entry number in the audio man- 
agement information table (Audio_Table) are initialized. 

Add = VOB_Start_Address 

i = 1 

j=1 

10 

(step 1) 

[0102] The variable j and the number of audio man- 
agement information entries (Number_of_Audiols) are 
15 compared and if 

j <= Number_of_Audiols 
is satisfied, control is passed to step 3 where the audio 
data and the address are compared. Otherwise, control 
is passed to step 5. 

20 

(step 2) 

[0103] The current address Add and the address in- 
formation in audio management information #j are com- 
25 pared and if 

Add == Audio[j]. Address 
is satisfied, the current address Add is the start address 
of the audio data managed by audio management infor- 
mation #j (Audio #j) and therefore control is passed to 
30 step 4 where the current address is added. If the above 
expression is not satisfied, control is passed to step 5. 

(step 3) 

35 [0104] The audio data size in audio management in- 
formation #j (Audiol #j) is added to the current address 
Add, the variable j is incremented, and control is passed 
back to step 2. 

Add += Audio[j].Size 
40 j++ 

(step 4) 

[0105] If the conditional expression in step 2 or step 
45 4 is not satisfied, it means that the current address Add 
is a still image data address and therefore the still image 
address is determined. 

(step 5) 

so 

[01 06] Next, a check is made to see if there is a pointer 
to audio management information (Audiol). If there is a 
pointer, control is passed to step 7 to search for audio 
data synchronizing with the still image to be played back 
55 with the still image. If there is not such a pointer, control 
is passed to step 10 to play back the still image. 
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(step 6) 

[0107] The audio data to be played back in synchro- 
nization with the still image is assigned temporarily to 
Ptr_to_ Audiol. s 

(step 7) 

[0108] A search is made to see if the audio manage- 
ment information (Audiol) pointed to PTR_to_Audiol 10 
points to another audio management information (Audi- 
ol) entry. If there is a link to another audio management 
information (Audiol) entry, control is passed back to step 
7. 

75 

(step 8) 

[0109] When it is found in step 8 that there is no more 
link to another audio management information (Audiol) 
entry, the audio data to be played back in synchroniza- 2° 
tion with the still image is determined. 

(step 9) 

[0110] The still imago data determined in step 5 and 25 
the audio data determined in step 9, if found, are played 
back. 

(step 10) 

30 

[0111] The variable i is incremented. 
i++ 

(step 11) 

35 

[0112] The variable i is compared with the number of 
still image management information entries 
(Number_of_Videols) and if 

i <= Number_of_Videols 
is satisfied, it indicates that there is still another piece of 40 
still image data to be played back in the still image set 
(VOBS). Control is passed back to step 2 If the above 
expression is not satisfied, the playback of the VOBS 
ends. 

45 

(step 12) 

(VOBSI data size) 

[01 1 3] Next, the management information size for the $o 
still image set used in the embodiment is described. 
[0114] As shown in FIG. 12, management information 
on one still image rcquiros 2 bytes, ono byte for the still 
image size and one byte for the pointer to audio data. 
Thus, even if 65,000 still images are taken, the size is 55 

65,000 x 2 bytes = 130,000 bytes 
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That is, the size is about 1 30KB. As compared with 1 .4 
MB described in the prior art, this size is as small as 
10% of 1.4 MB. 

[01 1 5] When audio data is recorded at the same time, 
adding audio data to all 65,000 still images is unrealistic 
in terms of capacity even on the DVD-RAM which is a 
large-capacity recording medium. 
[0116] If the size of one still image is 50KB, then 

4.7GB - 65,000 x 50KB = 1 45GB 

If each audio data piece is 192kbps and 10 seconds, 
then 

1. 45GB/ 192kbps x 10sec = 6,041 

This means that as many as about 6,000 audio data 
pieces may be recorded. Because each audio data 
management information entry requires 9 bytes, the to- 
tal is calculated as: 

6,000 x 9 bytes = 54,000 bytes 

The total is 1 84KB, which is about 1 3% of the conven- 
tional management information. 

[0117] As a modification of the management method 
described in FIG. 12 to FIG. 14, 4 bytes of the still image 
data address information (Address) may be added to the 
still image management information (Videol) for each 
still image, composed of one byte of size information 
(Size) and one byte of pointer information 
(Ptr_to_Audiol) pointing to the audio management infor- 
mation, shown in FIG. 12. This means that, though the 
data size of the management information on one still im- 
age is increased to 6 bytes as compared with that of the 
above method, access to the still image data becomes 
easier. At this time, when there is no audio data to be 
played back in synchronization with the still image, the 
management information may be reduced to about 29% 
(6/21) of the management information data size (21 
bytes for each still image) of the prior art shown in FIG. 
9. (Block diagram of the DVD recorder) 
[0118] FIG . 1 5 is a block diagram of the DVD recorder 
used in the embodiment of the present invention. 
[0119] In the figure, the reference numeral 1501 is a 
user interface unit displaying information to, and receiv- 
ing a request from, the user, 1502 is a system controller 
performing overall management and control, 1503 is an 
input unit consisting of a camera and a microphone, 
1504 is an encoder consisting of a video encoder, an 
audio encoder, and a system encoder, 1 505 is an output 
unit consisting of a monitor and a speaker, 1506 is a 
decoder consisting of a system decoder, an audio de- 
coder, and a video decoder, 1507 is a track buffer, and 
1508 is a drive. The system controller 1502 is a micro- 
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computer or some other digital signal processor. The 
system controller 1502 controls access to the optical 
disk as directed by the program whose flowchart is 
shown in FIG. 14. 

[0120] First, the recording operation of the DVD re- s 
corder is described by referring to FIG. 15. 
[0121] First, the user interface unit 1501 receives a 
request from the user. The user interface unit 1501 
sends the user request to the system controller 1502, 
and the system controller 1502 interprets the user re- 10 
quest and makes a processing request to each module. 
When the user request is to take and record a still image, 
the system controller 1502 requests the encoder 1504 
to encode one video frame and audio data. 
[01 22] The encoder 1 504 video-encodes and system- *5 
encodes one video frame sent from the input unit 1503 
and sends the result to the track buffer 1507. 
[0123] Next, the encoder 1504 tells the system con- 
troller 1502 that the still image data has been created. 
The system controller 1502 records the still image data 20 
stored in the track buffer 1507 onto the DVD-RAM disk 
via the drive 1508. 

[0124] After encoding the video data, the encoder 
1504 immediately starts audio-encodes the audio data 
sent from the input unit 1 503 and sequentially transfers % s 
the generated audio data to the track buffer 1507. 
[0125] The encoder 1504 tells the system controller 
1 502 that audio encoding has been started. The system 
controller 1502 sequentially records the audio data 
stored in the track buffer 1 507 onto the DVD-RAM disk 30 
via the drive 1508. 

[0126] A stop request from the user is sent to the sys- 
tem controller 1502 via the user interface unit 1501. The 
system controller 1 502 sends the recording stop instruc- 
tion to the encoder 1504. The encoder 1504 ends en- 35 
coding after the immediately-following audio frame is 
encoded, transfers all audio data to the track buffer 
1 507, and tells the system controller 1 502 that encoding 
has finished. The system controller 1502 records all re- 
maining audio data to the DVD-RAM disk via the drive *o 
1508. 

[0127] After finishing the above operation, the system 
controller 1502 creates the above -described VOBSIs 
and cellls and records them on the DVD-RAM disk via 
the drive 1 508. Al this time, it is important that link infor- 45 
mation (Ptr_to_Audiol) pointing to the audio manage- 
ment information (Audiol) in the still image management 
information (Videol) is generated so that it points to the 
audio management information (Audiol) of audio data 
recorded at the same time. so 
[0128] When the user continuously records still imag- 
es and audio data as described above, one VOBS is 
created. The VOBS is a unit in the data structure and, 
at the same time, a block of still images taken continu- 
ously by the user at the same time. A plurality of VOBSs ss 
can be created within one recording medium. 
[0129] Next, the playback operation of the DVD re- 
corder is described with reference to FIG. 15. 
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[0130] First, the user interface unit 1501 receives a 
request from the user. The user interface unit 1501 
sends the user request to the system controller 1502, 
and the system controller 1502 interprets the user re- 
quest and makes a processing request to each module. 
When the user request is to play back a PGC pointing 
to a still image set (VOBS), the system controller 1502 
reads the PGC information (PGCI) via the drive 1508 
and, from VOBSJD described in the cell information 
(Celll) of the PGCI that was read, reads the VOBS in- 
formation (VOBSI). 

[0131] Next, according to the flowchart described in 
FIG. 1 4, the system controller 1 502 checks the address 
of the still image data to be played back, checks if there 
is audio data to be played back in synchronization with 
the still image data, and finds the audio data. 
[01 32] Next, the system controller 1 502 asks the drive 
1 508 to read the still image data first and then audio data 
(if any) from the DVD-RAM disk and to store them into 
the drive 1508. 

[0133] Then, the system controller 1502 issues a de- 
code request to the decoder 1506. The decoder 1506 
reads AV data from the track buffer 1507 and decodes 
it. The decoded data is displayed on the monitor, or out- 
put from the speaker, via the output unit 1505. 
[0134] In this embodiment, an example of DVD-RAM 
is described. The present invention is not limited only to 
a DVD-RAM or an optical disk but applies to other media 
too. Other media include random access recording me- 
dia such as a magneto-optical disk, magnetic disk, and 
semiconductor memory. 

[0135] In the embodiment, still image data VOBs and 
audio data VOBs are recorded in an AV file separately 
from other VOBs. They may also be recorded in the AV 
file in which other VOBs are recorded. The present in- 
vention is not limited by the configuration of an AV file. 
[01 36] In the embodiment, the order of audio manage- 
ment information (Audiol) entries in the audio manage- 
ment information table (Audio_Table) matches the order 
in which data is recorded in the AV file. In essence, the 
order is not limited. However, when the order of audio 
management information (Audiol) entries do not match 
the order in which data is recorded in the AV file, the 
search for the audio management information (Audiol) 
is not narrowed down to one entry and therefore all au- 
dio management information (Audiol) must be 
searched. 

[0137] In this embodiment, all still images and all au- 
dio data managed by a VOBSI are recorded in an area 
in the AV file beginning at VOBS_Start_Address and 
ending at VOBS_End_Address. However, audio data, 
especially audio data recorded through post-recording, 
need not be recorded in this range but may be recorded 
in any position within the AV file as long as it is not in- 
cluded in a recording area (from VOBS_Start_Address 
to VOBS_End_Address) managed by some other 
VOBS. 

[0138] In addition, a one-bit playback identification 
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flag (Playback Permission), if provided in still image 
management information (Videol) to indicate the play- 
back enable/disable option as shown in FIG. 16, ena- 
bles the user to specify a still image not to be played 
back., that is, a still image to be skipped. This ability al- 
lows the user to play back only the selected still images 
from a large number of still images that were taken. 
[0139] In FIG. 12, an address is represented by 4 
bytes. The address may be represented by 3 bytes be- 
cause the maximum number of sectors (2046 bytes) on 
a 4.7GB disk is 2,464,153 (=4.7x1024x1024x1024/ 
2048) and therefore all sector addresses on the disk 
may be represented by at least 22 bits. 
[0140] The embodiment of the present invention is an 
optical disk on which at least still image data is recorded, 
the optical disk comprising still image set management 
information (VOBSI) managing synthetically a plurality 
of still image data pieces as one still image set and a 
variable-size still image management information table 
(Video_TabJe) proportional to a number of still images 
managed by the still image set management information 
(VOBSI). When audio data to be played back in syn- 
chronization with the still images is recorded, the optical 
disk further comprises, a variable-size audio manage- 
ment information table (Audio_Table) proportional, in 
size, to the number of the audio data pieces to be played 
back in synchronization with the still images in the still 
image set. The still image management information ta- 
ble (Video_Table) comprises at (east one still image 
management information (Videol) entry composed of a 
still image data size and pointer information 
(Ptr_to_Audiol) pointing to the audio management infor- 
mation (Audiol) to be played back in synchronization 
with the still image. 

[01 41] As a result, the present invention compresses 
the management information on the still images and au- 
dio data, reducing them to a little larger than 10% of that 
used in the conventional configuration. 
[0142] The audio management information table 
(Audio_Table) comprises at least one audio manage- 
ment information (Audiol) entry composed of an audio 
data address, an audio data size, an audio playback 
time, and, when post-recording is used, pointer informa- 
tion (Ptr_to_Audiol) having a link to other audio man- 
agement information (Audiol). Therefore, the present in- 
vention allows the user to perform post-recording with- 
out losing the original audio management information. 
[0143] For each still image in the still image set, a 
playback identification flag (Playback_ Permission), 
which indicates whether or not the still image is to be 
displayed during playback, is provided in the still image 
management information (Videol). Therefore, the 
present invention allows the user to specify that unnec- 
essary still images be skipped during playback. 
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Claims 

1. A recording medium comprising; 

a still image data area (102) capable of storing 
a plurality of still image data (VOB) pieces 
therein; and 

an area (102) storing still image set manage- 
ment information (VOBSI) therein for managing 
synthetically a part or whole of still image data 
(VOB) in said still image data area as one still 
image set (VOBS), 

said still image set (VOBS) having said corre- 
sponding still image set management informa- 
tion (VOBSI). 

2. A recording medium according to claim 1 further 
comprising: 

an area (102) capable of recording audio data 
therein, wherein said still image set manage- 
ment information (VOBSI) comprises still im- 
age management information (Videol) for each 
still image, wherein said still image manage- 
ment information (Videol) comprises a data 
size of said still image and information 
(Ptr_to_ Audiol) indicating whether or not the 
audio data to be played back in synchronization 
with said still image is recorded. 

3. A recording medium according to claim 1 further 
comprising: 

an area (102) recording therein audio data to 
be played back in synchronization with said still 
image, wherein said still image set manage- 
ment information (VOBSI) comprises audio 
management information (Audiol) comprising a 
data size of said audio data and a playback time 
of the audio data. 

4. A recording medium according to claim 1 further 
comprising: 

an post -recording area (1 02) capable ol record- 
ing therein audio data after said still image is 
recorded, wherein said still image set manage- 
ment information (VOBSI) comprises pointer 
information (Ptr_to_Audiol) specifying the area 
of audio management information managing 
the audio data recorded in said post-recording 
area. 

5. A recording medium according to claim 2, wherein 
said still image management information (Videol) 
comprises, for each still image, a playback identifi- 
cation flag (Playback_Permission) specifying 
whether or not the still image will be displayed dur- 
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ing playback. 

6. A recording medium according to any of claims 1 to 
5, wherein said recording medium is an optical disk 
to and from which data may be written and read. 

7. A recording unit recording data on the recording 
medium according to claim 1 , said recording unit 
comprising: 

recording means (1508) for recording the still 
image data in said still image data area (102) 
on said recording medium; and 
means (1502) for creating still image manage- 
ment information (Videol) when said still image 
data is recorded, said stili image management 
information (Videol) managing said still image 
data, wherein said still image management in- 
formation (Videol) is stored in the area (1 02) by 
said recording means (1508), said area con- 
taining said still image set management infor- 
mation (VOBS!) on said recording medium. 

8. A recording unit according to claim 7, further com- 
prising: 

recording means (1 508) for recording audio da- 
ta on said recording medium; and 
means (1502) for creating audio management 
information (Audiol) when the audio data to be 
played back in synchronization with said still 
image data is recorded, said audio manage- 
ment information managing said audio data, 
wherein said audio management information 
(Audiol) is written into the area by said record- 
ing means (1 508), said area containing said still 
image set management information (VOBSI). 

9. A recording unit according to claim 7, further com- 
prising: 

recording means (1 508) for recording audio da- 
ta on said recording medium; 
means (1502) for creating audio management 
information (Audiol) when the audio data is re- 
corded after said still image is recorded, said 
audio management information managing said 
added audio data; and 

means for creating pointer information 
(Rr_to_Audiol) specifying the area containing 
the audio management information (Audiol) 
managing the added audio data, wherein said 
pointer information (Ptr_to_Audiol) is written in- 
to the area by said recording means (1508), 
said area containing said still image set man- 
agement information (VOBSI). 

10. A recording unit according to claim 7, further com- 
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prising means (1502) for setting, for each still im- 
age, a playback identification flag 
(Playback_Permission) into said still image man- 
agement information (Videol), said playback identi- 
5 fication flag (Playback_Permission) specifying 

whether or not said still image will be displayed dur- 
ing playback. 

11. A recording unit according to any of claims 7 to 10, 
io wherein said recording medium is an optical disk 

and said recording unit is an optical disk recorder 
capable of writing the data onto said optical disk. 

12. A playback unit playing back data recorded on the 
is recording medium according to claim 1 , said play- 
back unit comprising: 

means (1 502) for identifying the area where the 
still image to be played back is recorded based 

20 on information recorded in the still image man- 

agement information (Vtdeol) in said still image 
set management information (VOBSI); and 
means (1508) for accessing the area where 
said still image data is recorded on said record- 

2S jng medium and for reading the data therefrom. 

13. A playback unit playing back data recorded on the 
recording medium according to claim 3 : said play- 
back unit comprising: 

30 

means (1 502) for identifying the area where the 
still image to be played back is recorded based 
on information recorded in the still image man- 
agement information (Videol) in said still image 

35 set management information (VOBSI); 

means (1508) for accessing the area where 
said still image data is recorded on said record- 
ing medium and for reading the data therefrom; 
means (1502), in a case that an information 

40 (Ptr_to_ Audio) indicating that recording of an 

audio data to be played back in synchronization 
with said still image when said still image is 
played back, for determining an area for ac- 
cessing the audio data to be reproduced on the 

45 basis of said audio management information of 

the audio data; and 

means (1505) for playing back said still image 
and said audio data. 

so 14. A playback unit playing back data recorded on the 
recording medium according to claim 5, comprising: 

means (1 502) for identifying the area where the 
still image to be played back is recorded based 
55 on information recorded in the still image man- 

agement information (Videol) in said still image 
set management information (VOBSI); 
means for accessing the area where said still 
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image data is recorded on said recording me- 
dium and for reading the data therefrom; and 
means (1502) for preventing said still image 
from being played back when said playback 
identification flag (Playback_Permission) in- 
cluded in said still image management informa- 
tion (Videol) indicates a playback disable sta- 
tus. 

15. A playback unit according to any one of claims 12 
to 14, wherein said recording medium is an optical 
disk and said playback unit is an optical disk player 
capable of reading data from said optical disk. 
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